Bayesian parameter estimation for automatic annotation of gene functions using observational data and phylogenetic trees

نویسندگان

چکیده

Gene function annotation is important for a variety of downstream analyses genetic data. But experimental characterization remains costly and slow, making computational prediction an endeavor. Phylogenetic approaches to have been developed, but implementation practical Bayesian framework parameter estimation outstanding challenge. We developed computationally efficient model evolution gene annotations using phylogenies based on Markov Chain Monte Carlo estimation. Unlike previous approaches, our method able estimate parameters over many different phylogenetic trees functions. The resulting agree with biological intuition, such as the increased probability change following duplication. performs well leave-one-out cross-validation, we further validated some predictions in scientific literature.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic estimation of regularization parameter by active constraint balancing method for 3D inversion of gravity data

Gravity data inversion is one of the important steps in the interpretation of practical gravity data. The inversion result can be obtained by minimization of the Tikhonov objective function. The determination of an optimal regularization parameter is highly important in the gravity data inversion. In this work, an attempt was made to use the active constrain balancing (ACB) method to select the...

متن کامل

Bayesian Estimation of Shift Point in Shape Parameter of Inverse Gaussian Distribution Under Different Loss Functions

In this paper, a Bayesian approach is proposed for shift point detection in an inverse Gaussian distribution. In this study, the mean parameter of inverse Gaussian distribution is assumed to be constant and shift points in shape parameter is considered. First the posterior distribution of shape parameter is obtained. Then the Bayes estimators are derived under a class of priors and using variou...

متن کامل

Bayesian Models for Phylogenetic trees

introduction: inferring genetic ancestry of different species is a current challenge in phylogenetics because of the immense raw biological data to be analyzed. computational techniques are necessary in order to parse and analyze all of such data in an efficient but accurate way, with many algorithms based on statistical principles designed to provide a best estimate of a phylogenetic topology....

متن کامل

Bayesian estimation of concordance among gene trees.

Multigene sequence data have great potential for elucidating important and interesting evolutionary processes, but statistical methods for extracting information from such data remain limited. Although various biological processes may cause different genes to have different genealogical histories (and hence different tree topologies), we also may expect that the number of distinct topologies am...

متن کامل

Fuzzy Neighbor Voting for Automatic Image Annotation

With quick development of digital images and the availability of imaging tools, massive amounts of images are created. Therefore, efficient management and suitable retrieval, especially by computers, is one of themost challenging fields in image processing. Automatic image annotation (AIA) or refers to attaching words, keywords or comments to an image or to a selected part of it. In this paper,...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: PLOS Computational Biology

سال: 2021

ISSN: ['1553-734X', '1553-7358']

DOI: https://doi.org/10.1371/journal.pcbi.1007948